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f.  ABSTRACT 

This  paper  deals  with  the  problem  of  selecting  fair  multinomial  populations  compared 

V- 

with  a  standard.  Two  selection  procedures  are  investigated:  the  natural  selection  proce¬ 
dure  of  Gupta  and  Leu  (1989)  and  an  empirical  Bayes  simultaneous  selection  procedure.  It 
is  proved  that  the  natural  selection  procedure  is  a  Bayes  procedure  relative  to  a  symmetric 
Dirichlet  prior  distribution,  and  therefore  is  an  admissible  selection  procedure.  For  the 
empirical  Bayes  simultaneous  selection  procedure,  the  associated  asymptotic  optimality  is 
investigated.  It  is  shown  that  the  proposed  empirical  Bayes  selection  procedure  is  asymp¬ 
totically  optimal  relative  to  a  class  of  symmetric  Dirichlet  priors.  The  rate  of  convergence 
of  the  empirical  Bayes  selection  procedure  is  shown  to  be  of  order  0(exp(  —  rk  +  In  k))  for 
some  positive  constant  r,  where  k  is  the  number  of  populations  involved  in  the  selection 
problem. 


AMS  Classification:  Primary  62F07;  secondary  62C12. 


Key  Words  and  Phrases:  fair  multinomial  population;  entropy  function;  Gini-Simpson 
index;  natural  selection  procedure;  empirical  Bayes  selection  procedure;  asymptotically 
optimal;  rate  of  convergence,  f-  -  ^ - 


*  This  research  was  supported  in  part  by  the  Office  of  Naval  Research  Contract  N00014- 
88-K-0170  and  NSF  Grants  DMS-8606964,  DMS-8702620,  and  DMS-8717799 


1 


1.  INTRODUCTION 


'The  concept  of  diversity  within  a  population  is  of  considerable  importance  in  statistical 
theory  and  applications.  The  problem  of  measuring  diversity  arises  in  ^  variety  of  studies 
in  ecology,  sociology,  econometrics,  genetics  and  many  other  sciences. ''  For  a  multinomial 
population  with  m  cells,  the  index  of  diversity  is  a  function  of  the  corresponding  probability 
parameter  vector  p  =  (pj, . . .  ,pm).  In  practice,  a  Schur-convex  or  Schur-concave  function 
of  p  may  be  appropriate.  There  are  two  measures  of  diversity  of  a  multinomial  population 
which  have  been  commonly  used.  These  are  the  Shannon’s  entropy  function  and  the  Gini- 
Simpson  index.  The  notion  of  the  entropy  function  was  introduced  by  Shannon  (1948). 
The  Gini-Simpson  index  was  introduced  by  Gini  (1912)  and  Simpson  (1949).  Both  are 
Schur-concave  function  of  p. 

In  the  literature,  selection  procedures  using  indices  of  diversity  as  selection  criteria 
have  been  studied  by  many  authors.  Gupta  and  Huang  (1976)  studied  the  problem  of  se¬ 
lecting  the  population  with  the  largest  entropy  function  for  binomial  distributions.  Gupta 
and  Wong  (1975)  considered  the  problem  of  selecting  a  subset  containing  the  population 
with  the  largest  entropy  for  multinomial  distributions.  Dudewicz  and  Van  der  Meulen 
(1981)  investigated  a  selection  procedure  based  on  a  generalized  entropy  function.  Alam, 
Mitra,  Rizvi  and  Saxena  (1986)  studied  selection  procedures  based  on  the  Shannon’s  en¬ 
tropy  function  and  the  Gini-Simpson  index  using  the  indifference  zone  approach.  Rizvi, 
Alam  and  Saxena  (1987)  also  considered  a  subset  selection  procedure  based  on  certain 
other  diversity  indexes.  Recently,  Gupta  and  Leu  (1989)  have  studied  certain  selection 
procedures  based  on  the  Gini-Simpson  index. 

In  this  paper,  we  are  dealing  with  the  problem  of  selecting  fair  populations  compared 
with  a  standard  level.  Consider  k  independent  multinomial  populations  tti,  . . . ,  tt*.  For 
each  i  =  1  ,...,fc,  population  7Tj  has  m  -I':,,  and  is  characterized  by  the  corresponding 
probability  parameter  vector  p  =  (pii,.  ,m),  where  0  <  pij  <  1,  j  =  l,...,m,  and 

m 

53  Pij  =  1  for  each  i  =  1, . . . ,  k.  Define 
;=i 

m 

«.  =  »(&)  =  i  -  -?■  HD 

We  use  9,  as  a  measure  of  diversity  (or  uniformity)  of  population  7r,.  Note  that  since 
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m 

9,  =  p?j  —  it  is  essentially  equivalent  to  the  Gini-Simpson  index.  Also  note  that 
j= 1 

0  <  9,  <  1  —  d-.  For  a  given  constant  9o,  0  <  9q  <  1  —  population  7r,  is  said  to  be  a 
fair  population  if  0,  <  9q  and  a  bad  population,  otherwise.  Our  goal  is  to  derive  statistical 
selection  procedures  for  selecting  all  fair  populations  while  excluding  all  bad  populations. 
It  should  be  noted  that  the  problem  of  selecting  fair  multinomial  populations  has  been 
considered  by  Gupta  and  Leu  (1989)  through  a  classical  approach. 

m 

Let  Q  =  {p  =  (p p .  )|p.  =  (p.i,--.  ,p,m),0  <  Pij  <  1,  j  =  i,. ..  ,m,  £  Pij  =  1  for 
=  ;=1 
each  i  =  1, . . . ,  k)  be  the  parameter  space  and  let  A  =  {s|s  C  {1, . . . ,  k}}  be  the  action 

space.  When  action  s  is  taken,  it  means  that  population  7r,  is  selected  as  a  fair  population 

if  i  6  s  and  excluded  as  a  bad  population  if  i  ^  s.  For  p  £  fi  and  action  s  £  A,  the  loss 

function  £(p, s)  is  defined  to  be: 

L(g,s)  =  J^(9,  -  *o)/(,0,i_;L](ft)  +  -  9i)I[0M{9i).  (1.2) 

i€a  tgs 

In  (1.2),  the  first  summation  is  the  loss  due  to  selecting  certain  bad  populations  and  the 
second  summation  is  the  loss  due  to  not  selecting  certain  fair  populations. 

The  content  of  this  paper  consists  of  two  parts.  In  Section  2,  we  investigate  some  op¬ 
timal  properties  of  the  natural  selection  procedure  of  Gupta  and  Leu  (1989).  It  is  shown 
that,  for  the  loss  function  L(p,s )  of  (1.2),  the  natural  selection  procedure  is  Bayes  relative 
to  some  symmetric  Dirichlet  prior,  and  therefore,  it  is  admissible.  Section  3  deals  with  this 
selection  problem  through  a  parametric  empirical  Bayes  approach.  An  empirical  Bayes 
selection  procedure  is  proposed  and  the  corresponding  asymptotic  optimality  is  investi¬ 
gated.  It  is  shown  that  the  proposed  empirical  Bayes  selection  procedure  is  asymptotically 
optimal  relative  to  a  class  of  symmetric  Dirichlet  priors.  The  rate  of  convergence  of  the 
proposed  empirical  Bayes  selection  procedure  is  also  established,  and  shown  to  be  of  order 
0(exp(  —  rk  4-  In  A:))  for  some  positive  constant  r,  where  k  is  the  number  of  populations 
involved  in  the  selection  problem. 

2.  OPTIMALITY  OF  NATURAL  SELECTION  PROCEDURE 

For  each  *  =  1, . . . ,  Jc,  let  2Li  =  (-^<1 ,  •  •  • ,  X{m)  be  the  random  observation  associated 
with  population  tt,,  where  Xij,  1  <  j  <  m,  are  nonnegative  integer  random  variables  such 
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m 

that  0  <  Xij  <  N  and  ^  X,j  =  N.  Then,  Xi  has  the  probability  function 

j=i 

TV*  , 

fu,\p,) = -ts— —  n  ^  (2-1) 

ft  (*ijO  >=1 

i= i 

m 

at  point  £,  =  (x,i , . . . ,  xim)  for  which  0  <  xtJ  <  N,  1  <  j  <  m  and  £  xt]  =  N.  Let  X,  be 

j=  i 

the  sample  space  generated  by  Xt.  Let  X.  =  (X i, •  -,X *)  and  denote  the  corresponding 
observed  value  by  g  =  (£j, . . . ,  xk).  Also,  let  X  =  X\  x  . . .  x  Xk  denote  the  sample  space 

of  X 

A  selection  procedure  d  =  (d\,. . .  ,dk)  is  defined  to  be  a  mapping  from  the  sam¬ 
ple  space  X  into  the  product  space  [0,1]*.  That  is,  for  each  g  ~  (xlx...,  xk)  e  X, 
d(g)  —  (di(g), . . .  ,dk(g)),  where  d{(g)  is  the  probability  of  selecting  population  tt,  as  a  fair 
population  given  X  =  g  is  observed.  We  let  D  denote  the  class  of  all  selection  procedures 
defined  in  the  above  way. 

For  each  i  =  1, . . . ,  k,  (^, . . . ,  is  the  maximum  likelihood  estimator  of  p  ~ 
(Pi l,  •  •  •  i Pim )•  From  (1.1),  it  is  natural  and  reasonable  to  estimate  8,  by  0*  —  £  (  n  )2~  m- 

j=i 

Gupta  and  Leu  (1989)  proposed  a  natural  selection  procedure  dN  =  (d”, . . . ,  d£)  based  on 
8,,  i  =  1, . . . ,  Jc,  which  is  equivalent  to  the  following:  For  each  i  =  1, . . . ,  k, 


d?(X)  =  ( 1  lf  ^  - 6' 

1 0  otherwise, 


where  6  is  a  prespecified  positive  constant  such  that  0  <  6  <  1  -  Since  this  natural 
selection  procedure  is  heavily  dependent  on  the  constant  6,  we  denote  this  procedure  by 

In  the  following,  it  is  assumed  that  for  each  i  =  l,...,j b,  the  parameter  vector 
P,  =  (p«i  i  i  Pim )  is  a  realization  of  the  random  vector  =  (Pn,... ,  P,m ).  It  is  also 
assumed  that  , . . . ,  £*  are  iid  with  a  common  prior  distribution  Ga  belonging  to  a  class 
of  symmetric  Dirichlet  distributions  C,  where 

p/  \  171  TO 

C={9.l9.(£,»=(f“n*,r''  0  <W<1>  ;  =  £>„  =  1}.  (2.3) 
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For  a  prior  distribution  Ga  G  C  and  a  selection  procedure  d  =  (d\ , . . . ,  dk)  G  D,  we 
denote  the  corresponding  Bayes  risk  by  r(Ga,d).  From  (1.2)  and  the  statistical  model 
described  previously, 

* 

r{Ga,di)  =  ^ ri(Ga,di ) 


(2.4) 


i=i 


where 


r,{Ga,di)  =  ^2  d>U) 


i  ex 


I*]-* 


j= i 


n  /(*;) 


J=1 


+  Y1  f  (^0  -  o,)  Y[[f(x}\£  )ga{p  )]dp,  (2.5) 

LexJ  n.-(«o)  i=1 


_  jV!  r(ma)  i  =' 


ri  r(o+r.,) 


[r(a)]m  r(mt»+iV)  » 


f(&i)  =  //UilP.WjOdp.  = 

and  f2,(0o)  =  {£=  (£,»••  •>£*)  €  <  #<)}• 

Since  the  second  term  of  (2.5)  does  not  depend  on  the  selection  procedure  d ,  a  Bayes 
selection  procedure  dGa  =  (dGa  , . . . ,  )  can  be  obtained  as  follows:  For  each  i  =  1, . . . ,  fc, 


,  0  otherwise. 

Then,  we  have  the  following  theorem. 


(2.6) 


Theorem  2.1.  For  each  positive  constant  6  such  that  0  <  6  <  1  —  ^  and  > 

0O,  for  the  loss  function  L(p,s),  the  natural  selection  procedure  dN (l5)  given  in  (2.2)  is  a 
Bayes  procedure  relative  to  some  symmetric  prior  distribution  Ga. 


Proof:  First,  straightforward  computation  yields  that  for  each  i  =  1  and  j  = 

1,  —  *  m, 


cnp2  -  ,  _  (fo  +Of  +  l)(x<j  +a) 

1  (m«  +  iV  +  l)(mo  +  iV) 


and  therefore 


m  (m2  —  m)a  +  mN  +  m  x2;  —  iV2  —  N 

>=  i 


m(ma  +  N  +  l)(ma  +  A'-) 


(2.7) 
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Note  that 


1 

>= 1  m 

where 

(<5  +  ^ )miV 2  +  (m2  —  m)o  -f  miV  —  N 2  —  iV 
^  ^  m(ma  +  N  +  l)(ma  -f  N) 

Thus,  it  suffices  to  prove  that  for  given  #o>  0  <  9q  <  1  —  -L,  there  exists  a  positive 
a  such  that  H(a)  =  9q.  Since  H(a)  is  decreasing  in  or,  H( 0)  =  ~~j^j ~  >  by 
the  assumption,  lim  H(a )  =  0  <  #o  and  i/(a)  is  a  continuous  function  of  a  on  [0,oc), 

a— oo 

there  exists  a  unique  a  =  a($o)  >  0  such  that  H(a)  =  #o-  This  implies  that  the  natural 
selection  procedure  dN^  is  the  Bayes  procedure  relative  to  the  symmetric  Dirichlet  prior 
distribution  GQ(g0).  Hence  the  proof  of  this  theorem  is  complete.  □ 

The  following  corollary  is  a  direct  consequent  of  Theorem  2.1. 


Corollary  2.1.  For  each  positive  constant  8  such  that  0  <  8  <  1  —  >  ^°’ 

the  natural  selection  procedure  is  admissible  for  the  loss  function  L(p,s). 


3.  AN  EMPIRICAL  BAYES  SELECTION  PROCEDURE 

We  assume  that  the  hyperparameter  a  of  the  symmetric  Dirichlet  prior  distribution 
Ga  is  unknown.  In  this  situation,  it  is  not  possible  to  apply  the  Bayes  selection  procedure 
dGa  for  the  selection  problem  at  hand.  Thus,  the  empirical  Bayes  approach  is  employed 
here. 

m 

For  each  i  =  1  let  IF,  —  Xfj  and  ^  denote  the  observed  value  of  IF,. 

j=i 

Under  the  preceding  statistical  model,  Wi,...,W*  are  iid  random  variables  such  that 
—  <  IF,  <  N2 .  It  follows  from  straightforward  computations  that 


P2  =  E\WX)  =  JV[1  + 


(N  —  l)(o  +  1) 

mo  +  1 


and  therefore, 


N2-p  2 

m/i2  —  N(m  +  N  —  1)' 


(3.1) 


(3.2) 
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From  (3.2)  and  (2.7),  for  each  i  =  1, . . . ,  k, 

Wu-i 

7=1 

=  {(m2  —  m)(jY2  —  H2)  +  [ m/j.2  —  N(m  +  N  —  l)][miV  +  mwi  —  N 2  —  N ]} 
x  (m/i 2  —  miV  —  iV2  +  N)/[m(N  —  \){m^2  —  N2)N(mji2  —  N2  —  m  +  1)] 


=  Q  i(H2\t»i), 

m 

where  W{  =  xh  ■ 


(3.3) 


7=1 


Note  that  for  Wi(ov  x;)  being  kept  fixed,  Qi{n2\wt)  is  increasing  in  ^2- 
k 

Define,  £2  =  ^  H7,.  We  WM  use  fa 2  to  estimate  ^2  and  use  Q,(fa2\u't )  to  estimate 

i=l 

Qi(fi2\wi)-  However,  by  the  definition  of  ^2  (see  (3.1)),  N  +  <  /i2  <  N2  and 

Qt{/J.2\wi)  tends  to  zero  as  /j, 2  tends  to  N  -f  .  Also,  it  is  possible  that  {12  <  A’  + 

A(/v-i].  'p^g  we  define 


•(»,)  =  I0, 

lo 


(/I2I wi)  if  £2  >  N  + 


N(N-l) 


0  otherwise. 


(3.4) 


We  now  propose  an  empirical  Bayes  selection  procedure  d *  =  (<i*, . . .  ,dj)  as  follows: 
For  each  i  =  1, . . . ,  k, 

<  0o, 


«£)=(1  lfv'(“,')S 
(  0  otherwise. 


(3.5) 


In  the  following,  we  will  investigate  the  asymptotic  performance  of  the  empirical  Bayes 
selection  procedure  d*  for  the  case  where  k ,  the  number  of  populations  involved  in  the 
selection  problem  under  study,  is  sufficiently  large. 


Since  dGa  =  {dGa  , . . . ,  dGa )  is  the  Bayes  selection  procedure,  for  the  empirical  Bayes 

selection  procedure  d*  =  (d*,...,d£),  rj(Ga,d*)  —  r,(Ga,d^°)  >  0  for  each  i  =  1  ,...k 

k 

and  therefore,  r(Ga,d*)  —  r(Ga,dGa)  =  Yfa  [r»(^o,  d*)  —  r{(Ga,  dG “ )]  >0.  This  nonnega- 

i=l 

tive  regret  value  r(GQ,d*)  —  r(Ga,dGa)  will  be  used  as  a  measure  of  performance  of  the 
empirical  Bayes  selection  procedure  d* . 


Definition  3.1.  A  selection  procedure  d  =  (di,. . .  ,  <f*)  €  D  is  said  to  be  asymptotically 
optimal  of  order  {/?*}  relative  to  the  prior  distribution  G  if 

r(G,d*)-r(G,d°)  =  0(/9t) 
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where  {/?*}  is  a  sequence  of  positive  numbers  such  that  lim  /?*  =  0. 

k—*oo 

For  each  i  —  1, . . . ,  k  and  for  the  fixed  /i2,  Qi(^2|uq),  which  is  defined  in  (3.3),  can  be 
viewed  as  a  function  of  uq.  It  is  clear  that  Q,(/i2|u;i)  is  increasing  in  uq.  Let 

m 

.4,  =  {wi\wi  =  w,(x,)  =  :2tj,  x,  €  A',-,  Qi{fJ.2\wi)  <  0O} 

;=• 

m 

b,  =  =  wt(xt)  =  y>2„  x,  e  xx ,  >  #o} 

3=  1 

From  the  statistical  model  under  consideration,  Qi(fj,2\w)  =  Qj(n 2|u>)  =  Q(^2|ia)  (say)  for 
all  i,j  =  1, . . . ,  k.  Thus,  A\  ■=•••  =  .4*  =  A  and  B\  —  ■  ■  ■  —  B^  —  B  (say). 

Let  h(  w)  denote  the  common  marginal  probability  function  of  the  iid  random  variables 

m 

IF,  =  53  Xfj,  i  =  1, . . . ,  k.  From  (2.5)-(2.6)  and  (3.5),  straightforward  computation  yields 

3=  1 

that 

0  <r,(GQ,d*)-r,(Ga,df“) 

-  ]C  >  0o\Wi  =  ie,}/i(iu;)  (3.6) 

u>.€A 

+  Pt{Vi(wi)  ^0o\Wi  =  Wi}h(w,) 

u>i£B 

where  the  probability  measure  P,  is  computed  with  respect  to  {W\ , . . . ,  4F*_i,  VF,+i _ _ 

IF*)- 

Thus,  it  suffices  to  investigate  the  asymptotic  behavior  of  Pi{<p*(wi)  <  #o|IF,  =  uq} 
for  Wi  £  B  and  P,-{y?*(uq)  >  0o|Wi  =  ui,}  for  wx  €  A. 


Lemma  3.1.  For  each  c  >  0  and  for  sufficiently  large  k , 

(a)  P,{m2  -  H2  <  -c\Wi  =  Wi}<  0(exp(  —  kc2N~4(l  -  ±)~2/2)). 

(b)  Pi{ii2  -fi2>  c\Wi  =  wx}  <  0(exp(—kc2N~4(l  -  £)~2 /2». 

(c)  P,{/i2  <  *»,■}  <  0(exp(-fc(/i2-iV-(iV-l)iV/m)2iV-4(l-i)-V2)). 

Note  that  the  above  upper  bounds  are  independent  of  the  value  u>,. 
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k 

Proof:  (a).  Let  fa(i)  =  jry  Wj-  Then, 

i= i 

jV' 

P«{A2  -  fJ-2  <  —c\Wi  =  U),}  =  Pi{fa(i)  -  H2<  ~~j~\  +  }• 

r2  m 

Note  that  <  uq  <  N 2  for  all  uq  =  w i(x_i)  =  xh"  Thus  for  k  sufficiently  large 

j=i 

—  jZf  +  <  —  o-  Hence,  we  obtain,  for  k  sufficiently  large,  that 

P i  {P-2  ~  [J-2  <  —c\Wi  =  Wx } 

<  Px{fa(i)  ~  H2  <  ~~j 

<  exp{  — A:c2iV~4(l  —  — )_2/2), 

m 

where  the  last  inequality  follows  from  Theorem  2  of  Hoeffding  (1963).  Note  that  the  upper 
bound  is  independent  of  uq. 

The  proof  of  part  (b)  is  similar  to  that  of  part  (a).  By  letting  c  =  fj-2  —  [N  +  — 
then  c  >  0.  Thus,  the  result  of  part  (c)  follows  directly  from  part  (a).  □ 

Lemma  3.2.  For  uq  £  A, 

PiWKm)  >  0o\ Wi  =  w,}  <  CKexpt-AKQ-^ok.)  -  H2)2N~\l  -  -)-2/ 2)) 

m 

where  Q7  (’\wi)  is  the  inverse  function  of  Qj(-|u?,). 

Proof:  From  (3.4)  and  the  fact  that  fa  >  0, 

P{^*{wi)  >  0o|W",  =  U7,-}  =  P{Qi(fi2\wi)  >  0o\Wi  =  w,}.  (3.7) 

Now,  for  each  fixed  uq  £  A,  Q,(fawt)  is  strictly  increasing  in  n  for  N  + 

N 2  and  Qi{H2\u)i)  <  80.  Thus  ^2  <  Q,_1(^o |w«)-  Then, 

Qi(fa\m)  >  fa  «=>•  fa  >  QiX{9ofai) 

fa  -  H2  >  Q7l{9ofai)  -  H2  >  Q-  (3.8) 
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Combining  (3.7)  and  (3.8),  by  Lemma  3.1  (b),  we  obtain,  for  w,  6  A , 


Pi{<P*i(wi)  >  OoWi  =  Wj} 

=  P,{/i2  -  ^2  >  ~  M2|W.  = 

<  O(exp(-fc(<J-'(#0|u.i)  -  W)2.V-4(1  -  i)-2/2)). 

Thus,  the  proof  of  this  lemma  is  complete.  □ 

Lemma  3.3.  For  w,  €  B, 

P'iyXm)  <  *o| Wi  =  Wi}  <  0(exp(-kiQ~l(6Q\Wl)  -  /i2)2iV'4(l  -  -)~2/2)) 

m 

+  0(exp(-fc(p2  -  iV  -  (jV  -  l)N/m)2  N~A(1  -  — )~2/2)). 

m 


Proof: 

P,{9;K)<0o|W,  =  «;,} 

=  PiW'iivi)  <Oo,  P2<N+  -----V.  -  «b}  (3.9) 

m 

+  P,{^><)  <  *o,  M2  >  ^  4-  —  V,  =  u>,-}. 

m 

From  Lemma  3.1  (c), 

P.W*(wi)  <  *0,  p.2  <  n  +  =  w,} 

m 

<  0(exp(-fc(M2  -  JV  -  (N  -  l).V/m)2iV-4(l  -  -)~2/2)).  (3.10) 

m 

From  (3.4)  and  an  argument  analogous  to  that  given  in  the  proof  of  Lemma  3.2,  we 

have 

N(N  -  1) 

Pi{y*(wi)  <  0O,  m2  >  N  + - \Wi  =  Wi } 

m 

=  P,{<?«(M2K)  <  So,  fi2>N  +  N(N -  ~-1}  \Wi  =  «/,} 

m 

<  P|{M2  -  M2  <  QrV^olw.)  -  H2\Wt  =  «’i)  (3.11) 

=  O(exp(-M0r,(«o|ti.,)-^2)2JV-',(1  -  ~)-2/2)) 
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where  Qt  1  ( S i )  —  ^2  <  0  since  u’,  E  B.  Thus,  the  lemma  follows  from  (3.9)— (3. 11)  □ 

Let  t !  =  min{(Ql_1  (^o|u,t)  -  H 2)2AT_4(1  -  ^}-2 /2|u>,  E  A,  U  B,}.  Then,  tx  >  0 
since  Q''1  (#0  /  0  for  all  W{  E  A,  U  B{  and  .4,  U  B,  is  a  finite  set.  Then  r  = 

min(r1,(^2  -  Ar  -  (N  -  l)N/m)2N~4(l  -  ^  2/2))  >  0. 

The  following  theorem  describes  the  asymptotic  optimality  property  of  the  empirical 
Bayes  selection  procedure  d*  =  (dj, . . .  ,  d£). 

Theorem  3.1.  Let  d*  =  (d*,...,d£)  be  the  empirical  Bayes  selection  procedure  defined 
through  (3.4)-(3.5).  Suppose  that  the  prior  distribution  is  a  member  of  the  class  C  of 
symmetric  Dirichlet  distributions  given  in  (2.3).  Then 

(a)  For  each  i  =  1, ...,  fc,  ri(Ga,  d*)  —  r,(GQ,dfa  )<  0(exp(  — rfc)),  and 

(b)  r(Ga,d*)  —  r(Ga,dGa)  <  0(exp(—Tk  +  lnfc))) 
where  t  is  the  positive  constant  defined  previously. 

Proof:  Part  (b)  is  a  direct  result  of  part  (a).  Thus,  we  need  to  prove  part  (a)  only.  From 
(3.6)  Lemmas  3.2  and  3.3, 

rt(Ga,d*)-r,(Ga,d?°) 

<  0(exp(  —  rk))  £  M»0 

Wi£AuB 

—  0(exp(  — rfc)). 

Thus,  the  theorem  follows.  □ 
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